Planning and Learning with Adaptive Lookahead
نویسندگان
چکیده
Some of the most powerful reinforcement learning frameworks use planning for action selection. Interestingly, their horizon is either fixed or determined arbitrarily by state visitation history. Here, we expand beyond naive and propose a theoretically justified strategy adaptive selection as function state-dependent value estimate. We two variants lookahead analyze trade-off between iteration count computational complexity per iteration. then devise corresponding deep Q-network algorithm with an tree search horizon. separate estimation depth to compensate off-policy discrepancy depths. Lastly, demonstrate efficacy our method in maze environment Atari.
منابع مشابه
Information-Lookahead Planning for AUV Mapping
Exploration for robotic mapping is typically handled using greedy entropy reduction. Here we show how to apply information lookahead planning to a challenging instance of this problem in which an Autonomous Underwater Vehicle (AUV) maps hydrothermal vents. Given a simulation of vent behaviour we derive an observation function to turn the planning for mapping problem into a POMDP. We test a vari...
متن کاملA Lookahead Strategy for Heuristic Search Planning
Relaxed plans are used in the heuristic search planner FF for computing a numerical heuristic and extracting helpful actions. We present a novel way for extracting information from the relaxed plan and for dealing with helpful actions, by considering the high quality of the relaxed plans in numerous domains. For each evaluated state, we employ actions from these plans in order to find the begin...
متن کاملMachine Learning for Adaptive Planning
This chapter is concerned with the enhancement of planning systems using techniques from Machine Learning in order to automatically configure their planning parameters according to the morphology of the problem in hand. It presents two different adaptive systems that set the planning parameters of a highly adjustable planner based on measurable characteristics of the problem instance. The plann...
متن کاملLearning Rules for Adaptive Planning
This paper presents a novel idea, which combines Planning, Machine Learning and Knowledge-Based techniques. It is concerned with the development of an adaptive planning system that can fine-tune its planning parameters based on the values of specific measurable characteristics of the given planning problem. Adaptation is guided by a rule-based system, whose knowledge has been acquired through m...
متن کاملA neuromorphic model of spatial lookahead planning
In order to create spatial plans in a complex and changing world, organisms need to rapidly adapt to novel configurations of obstacles that impede simple routes to goal acquisition. Some animals can mentally create successful multistep spatial plans in new visuo-spatial layouts that preclude direct, one-segment routes to goal acquisition. Lookahead multistep plans can, moreover, be fully develo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2023
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v37i8.26149